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The ability to distinguish recorded announcements from other call 
types is an essential part of the mechanized service-evaluation proc- 
ess. To do this, all announcements will have a special- information- 
tone prefix. A frequency detector — which is based on the correlation 
functions of the received signal — would be used to decide which 
announcement was triggered by a specific call attempt. This paper 
evaluates the performance of the frequency detector in the presence 
of additive noise and frequency shift: induced by the announcement 
machine. The theoretical results, based on a calibration frequency, 
are very encouraging. To verify that use of this frequency is feasible 
in practice, an algorithm is proposed and its performance evaluated 
to show that it compares favorably to the theoretical one. 

I. INTRODUCTION 

As part of the process of evaluating the end-to-end performance in 
the telephone network, a sample of call attempts is evaluated and the 
attempts are classified into several categories, such as completed, busy, 
recorded announcements, etc. To mechanize this classification process, 
a machine must have the ability to distinguish between completed 
calls and recorded announcements. Current planning for the mecha- 
nized systems envisions the use of special-information-tone (sit) pre- 
fixes which are to be attached to recorded announcements and can 
then be automatically recognized by the mechanized classifier (as well 
as alert the customer to the fact that he is listening to a recorded 
announcement). By choosing four distinct sits, each representing a 
certain category of recorded announcements, the classifier will have 
the ability to distinguish between these categories as well as to recog- 
nize a recorded announcement in general. 

The sit is defined as a sequence of three consecutive tones. To get 
four distinct sits five frequencies were chosen: f\ <f2<J\<h<h, and 
each sit consists of f, JJ, f 3 . The third tone has a fixed frequency and 
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will be used for calibration as is described later. The actual values of 
these frequencies are 904.5 Hz, 985.4 Hz, 1356.8 Hz, 1440.2 Hz, and 
1758.5 Hz. (The choice of these frequencies is mainly the result of 
constraints imposed by the ccitt definition of special information 
tones and tone generation and detection considerations.) To recognize 
which of the possible sits was received, the machine has to detect in 
the first tone whether it was /^ or ^ and in the second tone whether it 
was /Tor fi. This means that the classification process of the announce- 
ment categories is reduced to the two frequency-detection processes 
mentioned above. However, the planned direct recording of the sit 
followed by the recorded announcement on the various announcement 
machines in the telephone network may introduce significant degra- 
dation into the reproduced sit. In addition to additive noise, which is 
common to all signals in the network, frequency flutter and frequency 
shift of considerable effect on the reproduced tones may occur. The 
flutter effect, having an oscillatory nature, can be minimized by aver- 
aging properly the received data. In this paper, we report on our 
investigation of the combined effects of additive noise and frequency 
shift on the detection process. The detection scheme considered here 
involves use of correlation functions, and we have concluded that a 
reliable detection of sit frequencies is possible provided that a certain 
level of signal-to-noise ratio is ensured and the available data is 
properly used. It should be pointed out that in order to carry out the 
analysis we assume that the additive noise is white. We recognize, 
however, that in reality a colored additive noise may have a dominating 
effect on the error bounds. 

The structure of the paper is as follows. Section II presents the basic 
ideas of frequency detection via the correlation functions. In Section 
III, a white additive noise is considered to be present and the perform- 
ance of the detector in this case as a function of signal-to-noise ratio 
is evaluated. The effects of frequency shift are introduced in Section 
IV and the use of the calibration frequency to eliminate these effects 
is considered. The performance of the frequency detector as a function 
of the signal-to-noise ratio is evaluated and the results of using the 
calibration frequency are compared to the results when it is not used; 
a significant improvement is observed. 

The performance evaluations in Sections III and IV are based on 
using the likelihood-ratio test. This, because of the complexity of the 
expressions involved, is not practically feasible; however, these evalu- 
ations provide bounds on the performance of other schemes. In Section 
V, an algorithm for a detection process is presented. This algorithm is 
simple enough to be practical and its performance compares favorably 
to the theoretical one. 
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II. CORRELATION DETECTOR 

The use of correlation functions of a received signal for detection 
purposes was devised as part of the overall classification process by J. 
E. Walls of Bell Laboratories. 1 Some of the principles of the classifi- 
cation process that are relevant to our analysis are briefly described 
here. 

Let r{t) be a received signal observed for a period of length T. The 
correlation functions are defined by 



c =- 



T/2 



r(t)r(t + nA)dt n - 0, 1, 2 



(1) 



T/2 



where A < T (practically we want A to be small relative to T). 

We note that C is in fact the signal's average power in this time 
interval. 

Since we are interested in frequency detection, let us assume for a 
moment that the received signal is a pure frequency, namely 

r(t) = sin 2tt/* 

and derive expressions for the correlation functions in this case. Then, 



C -i 

U " rp 



T/2 



sin 27rft-sm 2irf{t + nh)dt, 



T/2 



or carrying out the simple integration results in 



C n = 



cos 27r/hA 



1 - 



sin 477/ T 
AirfT 



(2) 



The power term C and the next two terms C h Ci, are used for the 
detection process. Since 



*-i 



1 - 



sin Airft 



Airft 

Ci = Co cos 2tt/A 

C 2 = Co cos 2(2tt/A), 

if we normalize Ci and C 2 by the power Co we get the following 
relationship 



9i 



= 2 



C, 

Co 



- 1. 



(3) 



Expression (3) means that all pure frequencies correspond to a point 
on a parabola in the (Ci/C , C 2 /C ) plane. Clearly this parabola exists 
only for -1 < d/Co < 1 and -1 ^ C 2 /C < 1 (see Fig. 1) and G/C , 
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J — c '/ft 



f= 1356.8 Hz 



Fig. 1— Correspondence between various frequencies and the parabola on the (Ci/C , 
C2/C0) plane. 

C2/C0 are periodic functions of the frequency, so more than one 
frequency may correspond to the same point on the parabola. This 
means that for the detection process one must be concerned with 
the uniqueness of the points corresponding to the frequencies in- 
volved. For instance, if the two frequencies involved are f\ = Va A and 
/*2 = % A, both correspond to the point (0, —1) in the (C1/C0, C2/C0) 
plane. This means that computing Co, C h C 2 in this case is not sufficient 
to make the detection between these two frequencies possible even in 
the deterministic case (absence of noise). 

However, the choice of the parameter 1/A for a given set of frequen- 
cies involved in the detection process can ensure the necessary unique- 
ness. One way of doing it is to choose 1/A to be larger than twice the 
largest frequency to be detected, and this will be sufficient, as can be 
seen in Fig. 1. 

In the presence of noise, as is shown later, the point will shift from 
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the parabola towards the origin, while the origin itself corresponds to 
white noise for which both G and d are equal to zero. 

In practice, instead of integration, the received signal is sampled and 
the sampled values are then used to get a good approximation of the 
correlation functions. Let r(t) again denote the received signal, sam- 
pled with frequency F s = 1/A, and N the number of samples done in 
the observation interval T. Then eq. (1) will be replaced by 

i N-n-i 

Cn = -T T £ r,r,+ n , (4) 



N ,f 



where 



n = r 



i 



Again, for a pure frequency, after some manipulations it can be 
shown that 

1 """- 1 / / 

Cn = -r- T S sin2ir-£-isin2w-£-(i + ?i) 

iV ,-o i*s *a 



can be written as 



n l of 

C n = - cos n 2tt — 

2 F a 



. sin N 2tt — 

sin 2-tt -— 
F s J 



cos N 2-tt — sin n 2tr — 

+ In cos(N - 1)2tt L '-— -. (5) 

2 r s _ / 

sin 2-n — 



With the assumption that N is large compared to n = 0, 1, 2, we may 
write 



n 1 o f 
C n = -r cos 2tt — n 

2 r s 



sin N 2tt y 
l-Icos(N-l)2^ _1 

sin 2tt — 

t s . 



(6) 



so that the relationship eq. (3) holds here as well. In the Appendix, the 
values of the correlation functions computed by eqs. (1) and (6) are 
given for the frequencies used for sit. The error values introduced by 
going from eqs. (5) to (6) are also given in the Appendix for the same 
frequencies. 
Suppose now that a frequency, one of possible two, is transmitted. 
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If on the receiving end we make sure that the sampling frequency is 
large enough compared to the two possible frequencies, we ensure the 
uniqueness of the correspondence between these frequencies and the 
parabola defined by eq. (3). With this assurance, we can sample the 
received signal, compute Co, &, C2 according to eq. (4), and then use 
the values C1/C0 and C2/C0 to decide which frequency was initially 
sent. 

III. ADDITIVE NOISE 

In the previous section, frequency detection using correlation func- 
tions was described. As long as the received signal contains nothing 
but the pure frequency originally sent, the problem is clearly deter- 
ministic. However, in practice there are various degradations that 
affect the received signal. In this section, we are interested in evalu- 
ating the performance of the correlation detector in the presence of 
additive white noise. 

Thus, let the received signal be 

r(t ) - A sin 2irf k t + n(t) h - 1, 2, 

where A corresponds to the signal-to-noise ratio and n(t) is a normal- 
ized white Gaussian noise. The sampled values are 

r, = A sin k i + /i„ (7) 

where 



ri " r [w} m ' n w} ek ' 2v ¥: 



k - 1, 2, i m 0, 1, 2, ■ . . , N - 1, 

and m are independent identically distributed (iid) variables with zero 
mean and variance one. 
Substitution of eq. (7) into eq. (4) results in 

N-n-l 

£ j4 2 sin i6k sin(i + n)6k 



Cn N 



i-O 

N-n-l N-n-l 

+ S A[n i+ nsin.id + n,sin(i + n)0] + £ ntm+n 

i-0 i-O 



(8) 



Since eq. (8) contains random variables, Co, Ci, C2, and hence, the 
ratios C1/C0 and C2/C0 are also random variables with certain density 
functions. Once these density functions are known the detection proc- 
ess becomes a standard problem described in detail in various text- 
books on detection theory (see for example Ref. 2). We use the 
likelihood ratio test. To compute the threshold, we assume that the 
frequencies to be detected have equal a priori probabilities. The costs 
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are assumed to be zero and one for correct detection and error, 
respectively. With the knowledge of the density functions and the 
threshold, we can evaluate the performance of the detection process 
by computing the error probability. 

The first step then, is to develop the expressions for the density 
functions of Ci/Co and C2/C0. However, in view of the complexity of 
eq. (8), rather than attempt to develop an exact expression, we will 
make use of a version of the Central Limit Theorem (Theorem 4.2.5 in 
Ref. 3) and develop approximate expressions for these density func- 
tions. 

Let us now denote 

A 2 N-n-l 

Sn = -tt £ sin idk sin(i + n)0k. 



N £> 

A N ~ n ~ 1 
Yni - -jr: 2 [»«+«»in iO k + n«sin(i + n)B k ]. 

1 N-n-l 
Yn2 ™ "r= X /lifli+n- 

^V ,-_o 



(9a) 
(9b) 
(9c) 



Then, 



C n — S n + Y n l + Y n 2- 



(10) 



With some manipulation, we can get more convenient expressions 
for S n and YjJi: 



S n - — cos nB k 



— jr= — COS(iV - 1)0* . - 

iV AT sin 0* 

sin n#* 



+ -_tfcos(iV-l)& . 

2 sin^ 



N-n-l 



Yni = 2 — cos n0* X n.sin t0* 



AT 
+ 



A I"" -1 w-1 "I 

— £ n,-sin(i + n)0* + J /i,sin(i - n)0*. 

iV .1-0 N-n 1 



Now, since N » n we can write 



S* = — cos nBk 



i * /A7 ,v/, sin AW* 
1 - — cos(AT - 1)0* . 
iv sm 0* 



and 



N-l 



Yni = 2 — cos «0* £ /i,sin i0*, 

A/ i-0 
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or as 



We get 



Denoting 



s§=^ 



1 ,, T >„ sin N8 k 
N sin 6k 



yg, 



A N_1 
= 2 T? S Wisin Wk. 
Jy 1=0 



S* = Socos nO k 
Yn\ = Yoicos nO k . 



v k — 

X\ — 



(11a) 
(lib) 

(12a) 
(12b) 



Co* 



X k 2 = 



and using eqs. (10) and (12) we get 



Co* 



X 



k [S* + y*i]cos 8 k + y* 2 

i = — ~ t . „«. . , 



s§ + yg 



+ Y* 



02 



, _ [s* + yg^cos 20* + y 

So + Yoi + Y02 



(13a) 
(13b) 



We observe now that x * and x$ are functions of the random variables 
Yoi, Y02, Y12, Y22, each one of which is a linear combination of iid 
random variables. The commonly used version of the Central Limit 
Theorem can be applied to conclude that the above four variables 
have Gaussian limiting distributions. This in turn enables us to use 
Theorem 4.2.5 in Ref. 3 to find the distributions ofx* and**. 

The first step is to compute the means and variances of Yoi, 
Y02, Y*2, and Y22. Using the fact that the nt'a are iid Gaussian random 
variables with zero mean and variance one and using eqs. (9c) and 
(lib) we can readily verify that if 



Y* = 



"Y6\~ 


1 02 


Y*2 


1 22 



then 



E{Y k ) = 



1296 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1981 



and 



cov(Y*) = £ 



[Y* - E{Y k )][Y k - E{Y k }]' 



±s 


5o 

2 
N 























1 

N 














1 

N 



.(14) 



As in Ref. 3 denote vector b: 



b = E[Y k ] = 



From eq. (14), the entries of Y k , being Gaussian independent random 
variables, are jointly Gaussian, as is \/N(Y k — b) with zero mean and 
covariance matrix 









N S 


5o 













o |oo 






T 


= N 






N° 

•i 




Now, let 












x"(Y k ) = 


Xi 




and 












<t>u ■ 


i axf 


b, 




jNdY* 


so by eq. (13) 








$ = 


1 




cos Ok 
— S*cos 




cos 26k 
dk -Sgcos 2d k 


>/AT[S§ + 


I] 2 






(15) 



All this establishes the preconditions for Theorem 4.2.5 in Ref. 3, 
which then states that the vector [x k (Y k ) - x k (b)] has a Gaussian 
limiting distribution with zero mean and covariance matrix 
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R = <j>'T<t>. 
Since TV in our case is quite large, we may write 

p x "(x k ) =Px\f k (x\f k ) = 



(16) 



2ir\R k \" 2 



exp 



--[x-x k (b)]'Rk\x-x k (b)] 



, (17) 



where 



Rk = 



and 



N(S§ + l) 4 



(S k ) 2 [2 cos 2 6 k + 1] + 2S§[2 cos 2 0* + 1] + 1, 
2S§cos 0*cos 20 k (S% + 2), 

2S§cos 6 k cos 20* (S5 + 2), 
(S§) 2 [2 cos 2 20* + 1] + 2S£[2 cos 2 20* + 1] + 1 



Sg 



E{x * )=x * {b) =-^- 



cos Ok 
cos 20* 



From eqs. (11a) and (17), we can readily observe the way the signal- 
to-noise ratio, A, affects the density function. If A gets very large, S* 
becomes very large; then the mean value of x k approaches the parabola 
and the entries of Rk become very small — namely, we approach the 
deterministic case described in the previous section. On the other 
hand, if A approaches zero, So goes to zero as well and the mean value 
approaches the origin. This is understandable since as A gets smaller, 
the effect of the noise gets larger, dominating the signal; and, being a 
white noise, the cross correlation functions C\ and d approach zero. 

In Figs. 2 and 3, we see the effect of A as described above for two 
pairs of frequencies that were selected for the sit. The sampling 
frequency is 4000 Hz. It should be noted that the mean values for each 
frequency are on a straight line as is obvious from eq. (17). In the 
figures, for every A, one equal-probability contour is drawn around the 
mean and the fact that these contours get smaller as A gets larger is 
expected since the entries of Rk are getting smaller as was pointed out 
earlier. 

Once we have eq. (17) the detection is straightforward. With a priori 
probabilities and costs as described earlier, the measured data is used 
to compute the point in the (Ci/Co, C 2 /Co) plane and test which 
conditional density has higher value at this point. Then decide on the 
corresponding frequency as the one that was sent. More details of this 
procedure are described in Ref. 2. However, using the expressions we 
have for the density functions the process can be somewhat simplified. 
From eq. (17) and Ref. 2, 
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Fig. 2 — Dependence of the density functions P(x \ f) on signal-to-noise ratio for 904.5 
and 985.4 Hz (dashed lines are the equal likelihood points). 



Px\fSx\fi)zZp x \f 2 (x\f 2 ), 
ft 

which means that if the inequality holds in one direction, we decide on 
f\, and if in the other, f 2 . This is equivalent to 

In |fl 2 1 - ln|fl,| + [x- x\b)JRt\x - x 2 (b)] 

h 
-[x-x l {b)]'RT\x-x\b)]*S.O, (18) 

h 

which with an equality sign is a line in the (Ci/Co, C 2 /Co) plane. The 
dotted lines in Figs. 2 and 3 correspond to the above-mentioned line. 
On one side of these lines one density function has higher values, on 
the other side the other density function. The detection then consists 
of deciding on which side of this line the measured point falls. 
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Fig. 3 — Dependence of the density functions p(x| /") on signal-to-noise ratio for 1356.8 
and 1440.2 Hz (dashed lines are the equal likelihood points). 



To evaluate the performance of this detector, we compute the error 
probability. Let B k , k = 1, 2 be the part of the plane in which 
Px\f h (x\fk) has the larger value. Then the probability of error will be 



Pr(e)=- 



= \\ j P*\f 1 (x\f 1 )dx + j p X | /2 (x|/- 2 )dx|. 



(19) 



Figure 4 shows the error probability as a function of A for, again, 
the two pairs of frequencies of interest in the sits 904.5 Hz, 985.4 Hz, 
and 1356.8 Hz, 1440.2 Hz, and sampling frequency of 4000 Hz. Since, 
in general, signals transferred by the telephone network result in 
signal-to-noise ratios higher than 8 dB, the results here are encouraging 
for both frequency pairs. It is interesting to note that the higher 
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frequency pair results in a better performance. Observing Figs. 2 and 
3, one can see the reason for this difference in performance; the density 
functions for every signal-to-noise ratio are better separated in the 
higher pair. It turns out, that for every choice of a sampling frequency 
and difference in value, some pairs of frequencies — and not necessarily 
the lower valued frequencies — are more detectable than others. 

IV. ADDITIVE NOISE AND FREQUENCY SHIFT 

The use of announcement machines as tone generators will introduce 
two primary types of degradation, frequency flutter and frequency 
shift. In this section, we attempt to analyze the effects of the frequency 
shift with additive noise on the performance of the correlation detector. 
It is assumed that the flutter effect is eliminated by averaging a 
number of observations of each received tone. 

The complexity of the expressions involved in the analysis here 
makes closed-form results very difficult if possible at all. For this 
reason, digital computer calculations were used as the main tool in the 
analysis. 

Introducing the frequency shift, we get the expression for the re- 
ceived signal 



r(t) = A sin 2tt 



(l + n d )f 



t + n(t), 



0.4 



0.1 









\ 904.5 Hz 




\ --- "^"985.4 Hz 


_ 1356.8 Hz y \ 




1440.2 Hz 




J ! 


1 |V^ | \j | 



2 4 6 

A IN DECIBELS 



Fig. 4 — Probability of error when no frequency shift is considered. 
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where n d is assumed to be a random variable with extended beta 
distribution. This particular distribution is general enough to include 
many possibilities and agrees with the physical properties of n d 
(namely —1 <n d < 1). So /id's assumed density function is 

(B(l + n d )"- 1 (l-n d ) fi - 1 for |/i d |<l 
Pn d (n d )-< Q elsewhere ' 



where 

or+,8+1 

I 

B = 



T(a)TiP) 



and a,/3 > 1 are the two distribution parameters (in our computations 
we chose a = /? = 10 which fits reasonably the little data available for 
/i d ). 

The presence of this additional degradation causes the expressions 
developed in the previous section to be conditional on /id. This means 
that now, rather than having an expression far p x \f k (' | '), we have an 
expression foTp x \n d ,f k (' | ' ) or using eq. (17) we may write 



P'\n d ,f k (x\n d ,fk) = 



exp 



-^[x-E{x k }YRk 1 [x-E{x k }] 



, (20) 



where the expressions for R k , E{x k ), So are as before [see (11a) and 

(17)]andfc = 2*<I±^. 

Since for the likelihood ratio testp*|/ A (" | ') is needed, we can proceed 
to compute it using the relation 



■I 



Px\f k (x\fk) = Px\n d ,f k (x\nd,fk)Pn d (n d )dn d . 

With this and eq. (19), the performance of the detector can be evalu- 
ated for this case where both additive noise and frequency shift are 
present. In Fig. 5 the probabilities of error in detection as a function of 
the signal-to-noise ratio are presented. Comparing these results to Fig. 
4 makes it clear that the performance of the detector deteriorates very 
significantly when frequency shift is present. Even for a very high 
signal-to-noise ratio the frequency shift induces considerable error. 
The effect of the frequency shift alone, which provides lower bounds 
on the error probabilities in Fig. 5, can be calculated as follows. When 
A gets very large the received signal becomes a pure tone with a shifted 
frequency. This, however, implies that in this case it will be sufficient 
to consider only x\ = G/Co for the detection. Since x\ and n d are 
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10 



Fig. 5 — Probability of error when frequency shift is considered but no calibration 
frequency is used. 



related through 



Xi = cos 27T(1 + rid) tt, 



the knowledge of p„ d ( ) makes the calculation ofp Xt \f ki ('\) for each 
frequency /"*, straightforward. In Fig. 6 the density functions for the 
two pairs of frequencies are drawn and the thresholds for the detection 
in this case are the intersections of the density functions (also pointed 
out in the figure). 

The technique to overcome the frequency shift is based on using the 
third tone for calibration in the detection of the first two tones. This 
means that a fixed tone is sent, processed in the receiving end, and the 
knowledge of its exact value and the corresponding measurements can 
be used to improve the detection of the first two tones. 

To be more precise, let f 3 be the frequency of the third tone and 
x 3 = [Ci/Co, Ci/Co] computed from the corresponding measured data. 
Then through eq. (20) we know that 



Px3|„ d / 3 U 3 |n d ,/ 3 ) = 



2tt\R 3 \ 1/2 

exp 



- -[x 3 - Eix'WRrtx 3 - E{x 3 }] 
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f 2 = 985.4 Hz 



— f, = 905.4 Hz 




-1 -0.5 0.5 1 

FOR THE LOWER INTERVAL FREQUENCIES 

(a) 




1356.8 Hz 



-0.5 0.5 1 

FOR THE HIGHER INTERVAL FREQUENCIES 

(b) 

Fig. 6 — Density functions required for detection when frequency shift is considered 
but the additive noise is ignored (A — » oo). 

where the expressions for E{x 3 } and R 3 are as in eqs. (11a) and (17) 
with /~3 substituted for /*. Using this we can improve on our knowledge 
of the statistics of n d by computing p nj \ xV 8 ( • I • > • ) through the relation- 
ship 

r l 3 r-, P*»l«*fe(* 3 1 M * f3)Pn d (n d ) 

Pn d \x*J 3 \n d \X , f 3 \ = — j • 

P* 3 I n d / 3 [x 3 1 n d , f3\Pn d (n d )dn d 

This improved data on n d can then be used to calculate p x \ x \f 3 j k 
(.|.,.,.) 

P* |*Vm(* I * 3 > A] ■ I P*l **(* I "* foP** I *V 3 l>d I * 3 » fs\dn d , 
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which in turn will be used for the detection process, namely 

h 
Px I *»/,/,[* I x 3 , fa, /i] ^ Px \x\ulx | x 3 , f 3 , f 2 ]. 

h 



(21) 



In Figs. 7 and 8, we present — for this improved detection process 
with 1758.5 Hz as the calibration frequency — the error probabilities as 
a function of the signal-to-noise ratio. Note that the sampling fre- 
quency 4000 Hz is larger than 2 X 1758.5. Comparing this to the results 
without the calibration frequency, we observe considerable improve- 
ment, whereas the results computed with no frequency shift present 
provide a lower bound on error probabilities (or an upper bound on 
the improved detector's performance). 

The case when A — > oo (i.e., when the additive noise becomes 
negligible) is again of special interest but very simple. Since now with 
the knowledge of both x\ and f 3 the shift can be exactly computed, 

h 
xi = cos 27r(l + rid) -=r 

#8 



or 



n d = -z-^r cos ^i - 1, 

27T/ 3 



0.6 



\ 




WITH SHIFT AND NO 
\ CALIBRATION FREQUENCY 



\ 



\ 
\ 



\ 



\ 



\ 



WITH SHIFT AND 
'CALIBRATION FREQUENCY 



-6 -4 -2 



2 4 6 8 10 12 14 16 18 20 

A IN DECIBELS 



Fig. 7 — Error probabilities with frequency shift and calibration frequency — lower 
interval frequencies. 
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Fig. 8— Error probabilities with frequency shift and calibration frequency— higher 
interval frequencies. 



and the detection of the first two tones becomes deterministic. The 
only source of problems in this case is the question of uniqueness of 
cos"'xi. This can be overcome by considering both cos -1 *? and 2n — 
cos -1 *? and with probability one, one of them will recover one of the 
two possible frequencies. 



V. A SUGGESTED PRACTICAL CALIBRATION ALGORITHM 

In the previous section, we described how the use of a calibration 
frequency can theoretically improve the detection of tones that are 
affected by frequency shift and additive noise. However, practical 
computation of the density functions required in eq. (21), or the 
separating line (where the two density functions are equal) is impos- 
sible. Hence, an algorithm is required that is both compatible to the 
theoretical exact density function separation and simple enough to be 
practically implemented. Such an algorithm is presented and its per- 
formance evaluated. 

Let us first review the available data to be used for the detection. 
The received sequence of three tones is sampled and for each tone the 
correlation functions are calculated repeatedly and averaged to elimi- 
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nate frequency flutter effect. Then the cross-correlation functions are 
normalized to give the three vectors 



c\ 




c\ 




Cl 


Ch 




cl 




C 3 o 


CI 


x 2 = 


cl 


x 3 = 


cl 


FK 




L c§J 




F*°. 



corresponding to the three tones. Whereas the frequencies resulting in 
x 1 and x 2 , respectively, are not known (in each case they can be either 
one of two possible frequencies), the third one is known to result from 
1758.8 Hz. The idea is to use this knowledge to get an estimate of the 
frequency shift to help in the decision process of the first two tones. 
The suggested algorithm (see Fig. 9) is as follows: 
(i) Draw a line from the origin through x 3 ; it will intersect the 
parabola at 



1758.5 




y( 1356.8 + 1440.21- 



UPDATED 
MIDFREQUENCY 2 



UPDATED 
MIDFREQUENCY 1 



Fig. 9 — Geometric interpretation of suggested algorithm. 
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x 3 = 



Xi 




1 


. X 2. 




V 3 

X2 

. Xl . 



Xi, 



where 



V 3 — 

Xl — 



xl + V[*f] 2 + 8[xlf 
2x\ 



(ii) Use x i to compute the values 



a\ = cos 9, 
a l 2 = cos 20, 



ft}'" 1 ' 2 ' 



(22) 



(23) 



where 



„ 904.5 + 985.4 ... 

ft ' 2x1758.5 cos *■ 

= 0.5374 cos _1 f ? 



(24a) 



and 



e 2 = 



1356.8 + 1440.2 
2 X 1758.5 

= 0.7953 cos _1 f i. 



cos 1 xi 



(24b) 



(Hi) Draw the lines from the origin to the points (a\, ah): If x' is 
counterclockwise away from the corresponding line, the ith tone is of 
the lower frequency, and if it is clockwise away, it is of the higher 
frequency. 

The motivation behind this algorithm is quite simple. We have 
observed earlier that the additive noise effect is to shift the mean value 
of the pair (Ci/Co, C2/C0) towards the origin, whereas the frequency 
shift causes this pair to move along the parabola. The first step in the 
algorithm can be viewed then as isolation of the effect of the frequency 
shift. The point x 3 on the parabola is regarded as the result of the 
original frequency 1758.5 Hz, together with some shift that can now be 
estimated using the relationship 



X 1 = cos 2-n- 



1758.5 
4000 



(1 + n d ) 



or 



4000 



n d = 



,-l£3 



277 1758.5 



cos 'jci - 1. 
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This estimate is then used to update the midfrequencies for each tone, 
16(904.5 + 985.4) (1 + n d ) for the first and 16(1856.8 + 1440.2) (1 + n d ) 
for the second. The lines that connect the origin with the points 



cos- 



7t(904.5 + 985.4) 
4000 



(1 + n d ) 



and 



2tt(904.5 + 985.4) ,„ 

cos iooo (1 + nd) 



tt(1356.8 + 1440.2) . A v 

cos iboo {1 + nd) 

2?r( 1356.8 + 1440.2) 



cos 



4000 



(1 + n d ) 



provide the threshold lines for the detection of the frequencies of the 
respective tones. In Fig. 9 the algorithm is described geometrically. 

In Figs. 10 and 11 the performance of a detector using this algorithm 
is compared to the performance when exact separation is assumed. It 
is quite clear that the use of the algorithm results in a performance 
that is very close to the theoretical one. 
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2 4 6 
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Fig. 10 — Comparison of error probabilities for suggested detection algorithm to the 
exact separation — lower interval frequencies. 
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Fig. 11 — Comparison of error probabilities for suggested detection algorithm to the 
exact separation — higher interval frequencies. 

There are two difficulties in the described algorithm that will affect 
its performance. The first is because positive shifts above approxi- 
mately 14 percent will result in frequencies higher than the critical 
one — F s /2 (see Appendix) — for the third tone, and the points on the 
parabola are no longer uniquely related to their corresponding fre- 
quencies. This difficulty is inherent to the correlation function ap- 
proach and can be eliminated only if a significantly higher sampling 
frequency is chosen (approximately 7000 Hz). However, if we assume 
that the shift is always less than the critical one, even for the proposed 
sampling frequency, the effect does not seem to be significant since in 
the model we have chosen for the frequency shift the probability of 
having shifts higher than 14 percent is quite low. The second difficulty, 
which is inherent in the proposed algorithm, arises when the resultant 
x\ is less than —1. In this case, we propose simply to take it equal to 
—1 and again argue that the probability of this happening is very low 
even for small signal-to-noise ratios. Altogether, both difficulties, if 
treated as is suggested above, do not seem to affect the performance 
of the detection algorithm. 

VI. CONCLUSION 

In this paper, we addressed some of the problems in detecting 
recorded announcements encoded via the special information tones 
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(sit). In particular, we discussed problems that arise when additive 
noise is present. We have assumed that the frequency flutter effects 
are eliminated by averaging several observations, and investigated in 
detail only the additive noise and frequency shift effects. 

Our results support the conclusion that by properly using the infor- 
mation on the frequency shift, its effects can be made almost negligible 
and under these conditions high-performance sit detection can be 
achieved. 

The performance evaluations presented here make explicit use of a 
certain assumed model for the noise and frequency shift; however, the 
detection algorithm, which is proposed in Section V, is independent of 
any such assumptions. The performance of this algorithm is compa- 
rable to that theoretically achievable, and thus this algorithm is 
proposed for implementation in the sit classification process. 
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APPENDIX 

Approximations Used in Computing the Correlation Functions by Sampled 



Data. 




















(Hz) 


e 

(rad) 


CCo 


CCi 


CC 2 


Co 


c, 


c 2 


E t 


E 2 


904.5 
985.4 
1356.8 
1440.2 
1758.5 


1.42 
1.55 
2.13 
2.26 
2.76 


0.5 

0.499 

0.4993 

0.4993 

0.5004 


0.0747 

0.0114 

-0.2654 

-0.3184 

-0.4649 


-0.4778 
-0.4985 
-0.2171 
-0.0932 
-0.3632 


0.4971 

0.4982 

0.499 

0.4997 

0.496 


0.0743 

0.0114 

-0.2653 

-0.3186 

-0.4608 


-0.4749 
-0.4976 
-0.217 
-0.0933 
0.36 


0.0001 
0.0008 
0.0008 
0.0003 
-0.0019 


0.00004 
0.00003 
-0.0008 
-0.0004 
0.0034 



Continuously computed correlation functions [see eq. (2)]: 

„„ 1 , „, T sin Airft 
CC n = - cos(nd) 1 - 



2 L 4 *# 

Approximated correlation functions [see eq. (6)]: 

C„ = I cos(nd) { 1 - i- cos[(N - 1)0] sm ^ I 
2 [ N sin 6 J 

Differences between discretely computed correlation functions [see eq. (5)] 
and their approximated values. 

E " " lL co *U N - W] ^7T coa(Nd). 
2N sin 

[Note: F a = 4000 Hz; N = 167; T = Vh seconds and 6 = 2tt (f/F.)]. 
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